Automatic voice-source parameterization of natural speech

نویسندگان

  • Javier Pérez
  • Antonio Bonafonte
چکیده

We present here our work in automatic parameterization of natural speech by means of a pitch synchronous source-filter decomposition algorithm. The derivative glottal source is modelled using the Liljencrants-Fant (LF) model. The model parameters are obtained simultaneously with the coefficients of an all-pole filter representing the vocal tract response by means of a quadratic programming algorithm. Synthetic data has been created and analyzed in order to show the appropriate function of the estimation method. The parameterization results in high quality synthesized speech for voiced frames. Voice quality extraction is performed on basis to the LF source representation. The inherent modelling of the voice source makes it suitable for voice modification tasks. Work is in progress to add this speech representation to emotional speech synthesis and voice conversion algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing methods for automatic extraction of voice source parameters from continuous speech

Two methods are presented for automatic calculation of the voice source parameters from continuous speech. Both methods are used to calculate the voice source parameters for natural speech. However, for natural speech no objective test proce dure seems available. Therefore, both methods were also tested on synthetic speech.

متن کامل

A new speech synthesis system based on the ARX speech production model

In this paper, we present a new formant-type speech analysissynthesis system based on the ARX (Auto-Regressive with Exogenous Input) speech production model. The model consists of cascade formant-antiformant synthesizers driven by a voicing source and an unvoiced turbulent noise source. One of the key features of the proposed method is that we have an algorithm to automatically measure the voic...

متن کامل

On the relation between voice source parameters and prosodic features in connected speech

The behaviour of the voice source characteristics in connected speech was studied. Voice source parameters were obtained by automatic inverse filtering, followed by automatic fitting of a glottal waveform model to the data. Consistent relations between voice source parameters and prosodic features were observed.

متن کامل

Fitting a LF-model to inverse filter signals

A method is presented for the automatic extraction of voice source parameters from speech. An automatic i n-verse filtering algorithm is used to obtain an esti mate of the glottal flow signal. Subsequently, an LF-model [1] is fitted to the glottal flow signal. In the current article we will focus on the improvement of the automatic fit procedure. To keep track of the performance of the fit proc...

متن کامل

Natural Language Processing techniques in Text-To-Speech synthesis and Automatic Speech Recognition

This working paper depicts the usage of Natural Language Processing techniques in the production of voice from an input text, a.k.a. Text-To-Speech synthesis, and the inverse process, which is the production of a written text transcription from an input voice utterance, a.k.a. Automatic Speech Recognition.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005